ECHO CANCELLING/SUPPRESSION FOR HANDSETS 



Field Of The Invention 

The present invention relates generally to communications and in 
particular to an echo suppressor and method for suppressing echoes in a 
communication path. 

Background Of The Invention 

Acoustic coupling in telephone device handsets is common and occurs 
when signals broadcast by the speaker of a telephone device handset are coupled to 
the microphone of the same telephone device handset. This acoustic coupling results 
in echo signals being applied to the network over which the communication channel is 
established. The acoustic suppression of signals broadcast by the handset speaker 
before they are picked up by the handset microphone is variable and depends on a 
number of factors, such as the pressure exerted on the user's ear by the handset 
speaker and how well the handset speaker seals the ear. The loss in power of these 
signals as a result of acoustic suppression can vary from about -50db to as low as - 
24db when the telephone device handset is placed on a hard surface. If the delay in 
the network is short, such as for example 30ms, the normal acoustic suppression of 
signals broadcast by the handset speaker is generally sufficient to inhibit voice quality 
from being noticeably degraded by the echo signals picked up by the handset 
microphone. However, if the delay in the network is significant, echo signals applied 
to the network due to acoustic coupling will noticeably degrade voice quality over the 
communication channel. As such, suppressing echo signals resulting from acoustic 
coupling in telephone device handsets is important. 

Echo cancelers in telephone device handsets have been considered. A 
typical echo canceler attempts to model the transfer function of the echo signal path 
using a linear algorithm such as a Least-Mean-Squared (LMS) algorithm. The 
estimated echo signals generated by the echo canceler are subtracted from the echo 
signals picked up by the handset microphone. Differences between the estimated 
echo signals and the actual echo signals result in error signals, which are fed back to 
the echo canceler. Unfortunately, since the algorithm executed by the echo canceler is 
linear, the echo canceler cannot deal with non-linear effects and can only converge to 



-2- 

a transfer function which approximates the echo signals. As a result, residual echo 
error signals are applied to the network. 

It is therefore an object of the present invention to provide a novel 
echo suppressor and method for suppressing echoes in a communication path. 

5 

Summary Of The Invention 

According to one aspect of the present invention there is provided a 
method for suppressing echo signals generated in a communication path comprising 
the steps of: 

10 monitoring signals supplied to said communication path to determine 

an attribute thereof; and 

masking signals received from said communication path as a function 
of the determined attribute of said monitored signals. 

In a preferred embodiment, the attribute is the power level of the 
1 5 monitored signals. During the monitoring step, power level calculations are 

performed to determine the power level of the monitored signals. This is achieved by 
generating an envelope following the power level of the monitored signals. The 
envelope is generated by an infinite impulse response (IIR) lowpass filter. The IIR 
lowpass filter generates the envelope by solving the equation: 
20 AbsY-(l-alpha) AbsY + alpha* AbsY 0 

where alpha is a parameter of the IIR filter. 
An echo signal value is then calculated by solving the equation: 

Echo= AbsY/10 (A/20) 

where A is the minimum attenuation of echo signals in the communication path. The 
25 calculated echo signal value is used to select a mask to be combined with digitized 

signals received from the communication path. 

Preferably, the mask is a string of n-bits where n is a function of the 

echo signal value and wherein at least the most significant bits of the string have a 

zero value. In one embodiment, the bits of the mask are all zeros to basically achieve 
30 total echo suppression. In an alternative embodiment, the mask is leaky and at least 

the least significant bit of the mask has a one value. 
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According to another aspect of the present there is provided an echo 
suppressor to suppress echo signals generated in a communication path comprising: 

a power level calculator determining the power level of signals 
supplied to said communication path; and 
5 a mask generator responsive to said power level calculator and 

generating masks, said masks being generated as a function of the determined power 
level and being applied to the signals received from said communication path thereby 
to suppress echo signals received from said communication path. 

According to still yet another aspect of the present invention there is 
10 provided in a telephone device including a handset having a speaker to broadcast 

incoming signals and a microphone to receive outgoing signals, an echo suppressor to 
suppress echo signals picked up by the microphone as a result of acoustic coupling 
between said speaker and microphone comprising: 

a power level calculator determining the power level of incoming 
1 5 signals to be broadcast by said speaker; and 

a mask generator responsive to said power level calculator and 
generating masks, said masks being generated as a function of the determined power 
and being applied to said outgoing signals thereby to suppress echo signals. 

According to another aspect of the present invention there is provided 
20 an echo suppressor to suppress echo signals generated in a communication path 
comprising: 

an echo canceler in parallel with said communication path, said echo 
canceler having a transfer function approximating that of said communication path 
and generating estimated echo signals in response to signals supplied to said 
25 communication path, said echo canceler subtracting said estimated echo signals from 
signals received from said communication path to generate residual echo error 
signals; and 

a processor receiving said estimated echo signals and said residual 
echo error signals, said processor including a power level calculator to determine the 
30 power level of the estimated echo signals; and a mask generator responsive to the 
power level calculator and generating masks, said masks being generated as a 



function of the determined power level and being applied to said residual echo error 
signals thereby to suppress the same. 

The present invention provides advantages in that since echo signals 
are adaptively masked by the echo suppressor, echo suppression is achieved while 
5 maintaining voice quality. 

Brief Description Of The Drawings 

Embodiments of the present invention will now be described more 
fully with reference to the accompanying drawings in which: 
10 Figure 1 is a schematic diagram of a communication channel 

,g established between a pair of telephone devices over a network showing acoustic 

^ coupling in one of the telephone devices; 

CB Figure 2 is a schematic diagram of a telephone device including an 

echo suppressor in accordance with the present invention; 
1 5 Figure 3 is a graph showing an envelope calculated by the echo 

suppressor of Figure 2 following the power level of a signal to be broadcast by a 
telephone device handset speaker; 
i t- Figure 4 is a graph showing an echo signal and the mask generated by 

the echo suppressor of Figure 2 to "cover" the echo signal picked up by a telephone 1 
20 device microphone; and 

Figure 5 is a schematic diagram of a conventional echo suppressor and 
a non-linear processor to suppress residual echo signals in accordance with the present 
invention. 

25 Detailed Description Of The Preferred Embodiment 

Turning now to Figure 1, a communication channel established 
between a pair of telephone devices 12 and 14 over a network 16 is shown and is 
generally indicated to by reference numeral 10. As can be seen, when a 
communication channel is established between the telephone devices 12 and 14, 
30 acoustic signals 18 broadcast by the handset speaker 20 of receiving telephone device 
14 are acoustically coupled to the handset microphone 22 of the telephone device 14. 
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The echo signals picked up by the handset microphone 22 as a result of the acoustic 
coupling cause echoes in the network 16, which degrade voice quality. 
c_ ^ the defay in the network 16 is long, such as for example 150ms, 

which may be caused by voice packetization and local area network (LAN) 
5 propogation^delays, echoes in the network 16 as a result of acoustic coupling become 
audile detracting from voice quality. 

To suppress adaptively echo signals picked up by the handset 
microphone 22 as a result of acoustic coupling, an echo suppressor 32 in accordance 
with the present invention is provided in each of the telephone devices 12 and 14 

10 respectively. Figure 2 better illustrates telephone device 14. As can be seen, 

telephone device 14 includes a line receiver 40 coupled between the network 16 and 
an analog to digital converter 42. Analog to digital converter 42 provides digital 
output to the echo suppressor 32 and to a digital to analog converter 44. The digital to 
analog converter 44 conveys its analog output to a speaker driver 46, which amplifies 

15 the analog signals before they are broadcast by the handset speaker 20. 

Coupled to the handset microphone 22 is a microphone receiver 48, 
which provides output to an analog to digital converter 50. Analog to digital 
converter 50 provides digital output to the echo suppressor 32. Echo suppressor 32 in 
turn supplies output to a digital to analog converter 52 that is coupled to the network 

20 16 via a line transmitter 54. 

As can be seen, echo suppressor 32 couples the handset speaker 20 and 
the handset microphone 22 of the telephone device 14. The echo suppressor 32 
adaptively masks echo signals picked up by the handset microphone 22 to inhibit echo 
in the network 16. The adaptive masking performed by the echo suppressor 32 is 

25 based on the power level of signals to be broadcast by the handset speaker 20. This is 
due to the fact that typically, the larger the signals broadcast by the handset speaker 
20, the larger the echo signals picked up by the handset microphone 22 as a result of 
acoustic coupling will be. 

In the present embodiment, the echo suppressor 32 is embodied in a 

30 digital signal processor executing an echo suppression algorithm. The echo 

suppression algorithm performs a power level calculation 60 to determine the power 
level of signals received by the telephone device 14 to be broadcast by the handset 



speaker 20 and uses the determined power level to generate masks. The masks are 
subtracted from signals received by the handset microphone 22 via a multiplier 62 to 
mask echo signals picked up by the handset microphone 22. 

Appendix A shows psuedo-code representing the echo suppression 
algorithm executed by the echo suppressor 32. The echo suppression algorithm, in 
response to signals to be broadcast by the handset speaker 20, invokes a power level 
calculation routine (see Appendix B). During execution of this routine, an envelope 
following the power level of signals to be broadcast by the handset speaker is 
generated using an infinite impulse response (IIR) lowpass filter. The IIR filter 
generates the envelope by estimating the long-range average of the absolute value of 
the signal to be broadcast and is of the form: 

AbsY = (l-alpha)AbsY + alpha* AbsYO (1) 
Alpha is an IIR filter parameter and is chosen to provide a fast attack time and a slow 
decay time for the IIR filter. In the present embodiment, two different values for 
alpha are used, namely alpha_fast and alpha_slow depending on the power level of 
the signal to be broadcast by the handset speaker 20. Figure 3 shows an example of 
an envelope 64 generated by the echo suppressor 32 in response to a signal to be 
broadcast by the handset speaker where alpha_fast = 1 and alpha_slow = 2" 12 . As will 
be appreciated, by choosing these values for alpha, the echo suppressor generates an 
envelope that reacts fast to signals to be broadcast by the handset speaker 20. The 
slow decay time on the other hand compensates for small signal delays and reduces 
the switching effect when the signals fade. 

As the envelope is generated, the echo suppressor 32 invokes a mask 
selection routine to calculate the maximum expected value of the echo signal based on 
the envelope. As stated earlier, the echo signal received by the handset microphone 
22 is an attenuated copy of the signal broadcast by the handset speaker 20. The 
maximum expected value of the echo signal is calculated by solving the equation: 

Echo = AbsY/10 (A/20) (2) 
where A is the minimum attenuation or acoustic suppression of echo signals. 

Thus, for example in a case where signals broadcast by the handset 
speaker 20 undergo a minimum attenuation of -24dB before being picked up as echo 
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signals by the handset microphone 22, the maximum expected value of the echo 
signal according to equation (2) is equal to AbsY/1 5.8489. 

After the maximum expected value of the echo signal has been 
calculated, the mask selection routine selects the mask to be combined with the echo 
5 signal in accordance with the routine illustrated in Appendix B. In the present 

embodiment, the mask combined with the echo signal takes the form of a string of 
zeros n-bits long, where n is a function of the echo signal value determined at 
equation (2). The maximum value for n is determined by the maximum output value 
of the analog to digital converter 50, which in the present example is 8192. Solving 
10 equation (2) using this value for AbsY yields 517 which in binary format is 
i;3 1000000100. As a result, a mask having ten zeros (i.e. n = 10) is required to mask 

:, 2 this binary value. 

i!0 For example, if the power level of the signal to be broadcast by the 

handset speaker 20 is equal to 1379 after solving equation (1) to determine AbsY, by 
;'*7 1 5 solving equation (2) the expected echo signal level equals 87 (assuming A = -24 dB) 

a which in binary format is 1010111. In accordance with the mask selection routine, a 

J mask having seven zeros is chosen and is combined with the signals received by the 

jjjfj handset microphone 22 and digitized by the analog to digital converter 50. 

5 Figure 4 shows an echo signal 66 in a telephone device having a 

20 minimum acoustic attenuation or suppression equal to -24dB together with the mask 
68 selected by the echo suppressor 32. In the first half of the graph, it can be seen that 
the mask 68 completely covers the echo signal. In the second half of the graph, voice 
signals picked up by the handset microphone 22 superimposed on the echo signal are 
shown. As will be appreciated, the mask is orders of magnitude smaller than the 
25 voice signals. As a result, the mask causes only a minimum loss of speech quality. 

As will be appreciated, the echo suppressor 32, by selecting masks 
having the appropriate number of zeros, adaptively masks echo signals to achieve 
basically total echo suppression. 

If desired, "leaky" masks can be used to mask echo signals to inhibit 
30 noticeable switching, which may occur during total echo suppression in the presence 
of high background noise. "Leaky" refers to a mask having at least one least 
significant bit (LSB) with a "one" value. Generally, the number of LSBs having 




-8- 



"one" values is chosen depending on the number of bits in the selected mask. For 
example, if an 8-bit mask is generated the three LSBs of the mask can be leaked (i.e. 
have "one" values). If a 7-bit mask is generated, the two LSBs of the mask can be 
leaked. As will be appreciated, by leaking some of the background noise, switching is 
5 reduced. 

Turning now to Figure 5, an alternative embodiment of the present 
invention is shown. In this embodiment, a conventional echo canceler 70 is in parallel 
with a communication path 71 in which echo signals are generated. The 
communication path 71 may be a telephone device handset as described in the 
10 previous embodiment. Alternatively, the communication path 71 may be a handsfree 
telephone, network which causes network reflections or other source of echo signals. 
2 The echo canceler 70 attempts to model the transfer function of the echo signal path 

D using a Least-Mean-Squared (LMS) algorithm so that the echo canceler generates 

'jfj 

1 estimated echo signals that are the same as the echo signals received from 

^7 15 communication path 71. The estimated echo signals are subtracted 72 from the actual 

echo signals in an attempt to cancel the echo signals. The amount of cancellation is 
J commonly referred to as ERLE. Differences between the estimated echo signals and 

j=j the actual echo signals result in error signals. The error signals are fed back to the 

2 echo canceler 70 so that the echo canceler can attempt to converge to the correct 
■~ 20 transfer function. 

Unfortunately, the LMS algorithm only models linear effects in the 
echo path and does not deal with non-linear effects caused by for example, clipping, 
telephone key rattling, and frequency shifts. Therefore, the echo canceler converges 
to a transfer function that approximates the correct transfer function resulting in non- 
25 zero error signals. This causes residual echo signals. 

To suppress the residual echo signals, a non-linear processor (NLP) 80 
is provided and receives the estimated echo signals output by the echo canceler 70 as 
well as the residual error signals output by the subtractor 72. The NLP 80 executes an 
echo suppression algorithm similar to that executed by echo suppressor 32 to 
30 determine the power level of the estimated echo signals output by echo canceler 70 

and to generate masks based on the determined power level. The masks are combined 
with the error signals to suppress the residual echo signals. 



For example, assuming the^echo canceler 70 will achieve an ERLE 
equal to 18db, it can be expected thaHne residual echo signal will be eight tines 
smaller than the estimated ech^^gnal. During execution of the echo suppression 
algorithm, an envelope equ^l to 13798 is generated by solving equation (1). The 
expected echo sign^lfas a value equal to (1379/8)=173 after solving equation (2). 
The closest pow#r of two mask is OxFFDOO. Assuming it is only necessary to 
improve the^RLE with 12db to achieve a total ERLE equal to 30db, the least 
significant! bits can be leaked yielding a mask equal to OxFFOF. 

Although the echo suppressor 32 is shown as being incorporated in a 
telephone device to suppress echo signals generated as a result of acoustic coupling in 
the telephone device handset, those of skill in the art will appreciate that the echo 
suppressor and non-linear processor may be used to suppress echo signals generated 
in virtually any communication path in which echo signals are generated. 

In addition, although preferred embodiments of the present invention 
have been described, those of skill in the art will appreciate that variations and 
modifications may be made without departing from the spirit an scope thereof as 
defined by the appended claims. 



